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Conventional  /  Classic  Incremental 
Sampling  Design  Paradigm  (Strategy) 

•  Collect  n  =  1  incremental  sample  (IS)  for  each 
Sampling  Unit  (SU)  &  field  triplicates  for  10%  of  SUs. 

•  If  %RSD  <  30%,  results  fully  usable.  Otherwise,  flag 
results  (e.g.,  as  estimated). 

•  %RSDs  of  SUs  with  triplicates  “represent”  precision 
of  SUs  for  which  n  =  1 . 

•  Decisions  based  on  one  IS  result  acceptable  (e.g., 
maximum  of  SU  is  compared  with  a  decision  limit). 
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“Conventional”  Approach 


•  Advantages:  Cost  effective  and  simple  to 
understand  and  implement. 


•  Disadvantage:  Data  of  unknown  quality; 
decision  errors  are  not  controlled. 

>  False  positive  (FP)  error,  a 


(e.g.,  wrongly  concluding 


site  cone.  >  background/action  level) 


>  False  negative  (FN)  error,  /? 

(e.g.,  wrongly  concluding 

site  cone.  <  background/action  level) 
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Example  of  Classic  Approach 


•  IS  Field  triplicates: 

>  4.2 

>  2.0 
>11.2 

Sample  Statistics 

x  =  5.8,  s  =  4.8 

%RSD  =  (s  /  x)  x  100 
=  83%  »  30% 


Probability  Plot  of  Field  Triplicates 

Normal 


Mean 

5.774 

StDev 

4.813 

N 

3 

RJ 

0.956 

P*Vakje 

>0.100 

s  =  sample  standard  deviation 
x  =  sample  mean 


© 


4 


BUILDING  STRONG, 


Re-sampling  of  Triplicates 


•  Field  triplicates: 

>  0.82 

>  0.67 

>  0.78 


Sample  Statistics 

x  =  0.76,  s  =  0.077 
%RSD  =  10%  <  30% 


Probability  Plot  of  Field  Triplicates 

Normal 


X 


Mean 

0.7573 

StDev 

0.076-66 

N 

3 

RJ 

0.968 

P-Vakje 

>0.100 

<§> 
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•  The  two  sets  of  triplicates  were  actually  randomly 
selected  from  a  lognormal  distribution. 

Population  Parameters 

p  «  4.5,  a  «  5.9 
%RSD  =  130% 

Sample 

%RSD  x  95%UCL 

83%  5 .8  14 

10%  0.76  0.89 


•  Distribution  cannot  be  reliably 
determined  when  n  =  3. 
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Histogram  of  Lognormal  Distribution 


100,000  points 


Him 
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ITRC  Guidance  Incremental  Sampling 

Methodology  (ISM)  * 

...the  RSD  does  not  provide  an  indication  of  the 
accuracy  of  the  estimate  of  the  mean  or  95%  UCL... 


http://www.itrcweb.org/ISM-1/Executive  Summarv.html 


*  ISM-1,  Interstate  Technology  Regulatory  Council, 
February  2012 
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Huh' 
If  only 
it  were 
that 
simple 


ITRC  ISM  Guidance 


When  a  site  includes  many  [Decision  Units]  DUs,  it  may  be 
tempting  to  extrapolate  the  estimate  of  the  variance  (or  the  CV) 
[RSD]  from  one  DU  to  another .. . 

...Statistical  theory  suggests  that  we  can  expect  the  estimated  mean 
and  SD  [standard  deviation]  to  be  independent  for  normal 
distributions  but  positively  correlated  for  positively  skewed 
distributions  ...If  ...independent,  ...high  estimated  means  may  have 
low  SDs  and  vice  versa... By  contrast,  if  the  parameters  are 
correlated  because  of  some  asymmetry  in  the  distribution... it  would 
be  preferable  to  extrapolate  the  average  CV...A  priori  knowledge 
about  the  distribution  shape  is  unlikely... 

Translation:  Don’t  extrapolate  results; 

it  doesn’t  work! 
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Comparing  the  maximum 
with  an  Action  Level  (AL) 


12 
11 
10 
9 

c  8 

7 
6 
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4 

0  100  200  300  400 

%RSD 


n  vs  °/oRSD 


Assumes  Lognormal  Distribution 

%RSD  =  100%,  n*  7 


ln(a) 


\n{(j){  Z  <  0.5  x  J ln[(%RSD/100)2  +  1]  ) 


*'n(a,%RSD)  = 


n  =  number  of  samples  needed  to  be  95% 
confident  (a  =  0.05)  the  mean  (p)  will  be 
less  than  the  largest  detected  concentration. 
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Revised/Proposed  Paradigm 

•  Use  Data  Quality  Objective  (DQO)  or  Technical 
Project  Planning  (TPP)  Process. 

•  Determine  replicates  n  for  each  SU  using  statistical 
approach  (e.g.,  Visual  Sampling  Plan). 

r - \ 

n  t  -»  a ,  p,  (A  /a)  i 

l _ _ _ 


a  =  FP error,  =  FN error  (e.g.,  a,  [5  =  0.1  —  0.01 ) 

A  =  Change  important  to  detect  (e.g.,  A  =AL-p) 


<7  =  Standard  deviation  (Note:  %RSD  oc  a) 

fff] 
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•  Use  hypothesis  tests  to  compare  site 
means/medians  with  Action  Level  (AL)  or 
background  (BG)  (e.g.,  using  ProUCL). 

>  USACE  Engineer  Manual  (EM)  1 1 10-1-1014, 
Environmental  Statistics  (Jan  2008). 

>  EPA  QA/G-9S,  Data  Quality  Assessment:  Statistical 
Methods  for  Practitioners  (Feb  2006). 


-  Evcru thlvi0  tjou  wanted  to  fellow 
about  hypothesis  tests  b  ut  we  re 
afraid  to  aste. 

STATISTICS 

i'or  the 

Utterly 

Con-fused 

:  -inf  <ffn 

ATJI..  «  V' 

Y*ralor  oi 

11 

«rd  til.  -  o 

calculator 

Alternatives  to  Hypothesis  Tests 

Compare  upper  confidence  limit  (UCL)  of  mean 
with  AL;  conclude  site  is  “clean”  if  UCL  <  AL. 


Student’s  t  95%UCL  —  .v  +  t„  1,0.95  x  s /  V/7 

«  x  +  (2-3)  x  s  /  4n 


( x  is  normal,  n>  3) 


Chebyshev  95%  UCL  =  j 


+ 


1 


0.05 


-1 


x  s  /  4n 


J 


V 

\ 


x  +  4  s  /  yfn 


(x  is  not  normal) 


•  11 
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•  Calculate  (1-a)100%  (e.g.,  95%)  BG  upper 
prediction  limit  (UPL)  to  control  FPs. 

SUs  are  consistent  with  BG  if: 

(Normality  assumed) 

xi  <  (1-#)100%  UPL5G  =  xBG  +  n-iJ  l  sbg 

V  m  n 


a  Site- wide  FP  error 

n  Number  of  background  samples  (n  >  2) 

k  Number  of  site  SUs  (k  >  1) 

m  Number  of  (independent)  replicates  per  SU  (m  >  1) 

xt  Mean  of  m  replicates  for  zth  SU,  i  =  1,  2,. . k 

xBG  BG  mean 

sBG  BG  standard  deviation 


& 
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BG  sample  maximum  =  y  100%  UPL  (Non-parametric) 

y  =  y  (n  BG  samples,  k  SUs,  m  replicates  per  SU) 


f 


r= 


n 


\ 


v 


km  +  n 


C 


m 


< 


m—1 


J 


c 

v_-'j 


'km+n- 1 
km+n— 1— k 


+  Cf 


c 


m 


\k— 1 


m-1 


C 


/vW+/?-(A:-l) 


+  C 


c 


i/c-2 


m-1. 


c 

V_-'i 


km+n- 1 
km+n-{k-  2) 


+...+  C 


C: 


m-1 


^“1  z^km + n  —  1 


+1 


c 

V-'j 


km+n- 2 


y  =  Probability  at  least  m- 1  out  of  m 
replicates  will  be  less  than  the  maximum  of  72 
BG  samples  for  each  of  the  k  SUs. 


Gibbons,  R.  D,  Some  Additional  Nonparametric 
Prediction  Limits  for  Ground-Water  Detection 
Monitoring  at  Waste  Disposal  Facilities, 
Groundwater,  Vol.  29,  No.  5  (Sep  -  Oct  1991) 


y  >  0.9 
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Conventional  Versus 
Proposed  Approach 


Approach 

Conventional 

Proposed 

%RSD  measures  data  quality 

X 

Extrapolating  %RSDs 

✓ 

X 

Decisions  based  one  sample 

✓ 

X 

Statistical  sampling  design 

X 

V 

s  Potentially  acceptable 
*  Potentially  unacceptable 
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